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For a unified analysis on the phase estimation, we focus on the limiting distribution. It is shown 
that the limiting distribution can be given by the absolute square of the Fourier transform of 
function whose support belongs to [—1, 1]. Using this relation, we study the relation between the 
variance of the limiting distribution and its tail probability. As our result, we prove that the protocol 
minimizing the asymptotic variance does not minimize the tail probability. Depending on the width 
of interval, we derive the estimation protocol minimizing the tail probability out of a given interval. 
Such an optimal protocol is given by a prolate spheroidal wave function which often appears in 
wavelet or time-limited Fourier analysis. Also, the minimum confidence interval is derived with the 
framework of interval estimation that assures a given confidence coefficient. 
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I. INTRODUCTION 

Estimating/identifying the unknown unitary operator 
is discussed in both research fields of quantum compu- 
tation [3, ^2] and quantum statistical inference, therefore, 
it is a fundamental topic in quantum information. In 
quantum computation, the unknown unitary operator is 
given as an oracle, and it is discussed how many appli- 
cations are required for identifying the given unknown 
oracle with a given precision. In quantum statistical in- 
ference, in contrast, many researchers optimize the aver- 
age fidelity or mean square error between the true uni- 
tary and the obtained guess. In the both research sides, 
we optimize the state inputting the unknown unitary as 
well as the measurement, and quadratic speedup is re- 
ported from the both research sides [1, [1, [1,111 . However, 
the both areas discuss the same topic based on the dif- 
ferent criterion in this manner, and there is an example 
such that the quadratic speedup appears depending on 
the choice of the criterion. Therefore, since the relation 
between both criteria is not clear, it is required to treat 
this problem with a common framework. 

In the present paper, as the most typical example, we 
focus on the phase estimation, in which, the quadratic 
speedup was demonstrated experimentally [7]. Kitaev 
first treated the phase estimation problem in quantum 
computation viewpoint 8]. Since it appears in Shor's 
factorization, it is considered as a fundamental topic in 
quantum computation as well as physics. In order to 
treat the quadratic speedup more deeply, we focus on 
the limiting distribution, which provides the stochastic 
behaviour of the estimate around the neighborhood of 
the true parameter. That is, it provides the distribution 
of the random variable n{9 — 9) when the estimate, the 
true parameter, and the number of applications are given 
as 9, 9, and n. While the limiting distribution is a com- 
mon concept in statistics and was studied in estimation 
of quantum state [O, , it was not studied in esti- 
mation of the unknown unitary operator systematically. 

The concept of 'limiting distribution' is very useful for 



the following four points. First, the variance of the lim- 
iting distribution gives the asymptotic first order coeffi- 
cient of the mean square error. Second, the tail proba- 
bility of the limiting distribution for a given interval pro- 
vides the tail probability of the interval when the width 
of the interval behaves the order i. Third, using the 
limiting distribution, we can discuss the phase estimation 
under the framework of interval estimation, whose mean- 
ing is explained later. Forth, using this concept, we can 
treat the number of applications required for attaining 
the given accuracy, i.e., the error probability and the er- 
ror bar in the asymptotic framework. That is, the above 
four advantages correspond to respective criteria. There- 
fore, limiting distribution provides a unified framework 
for these criteria. The first three criteria are familiar in 
statistics, and the forth criterion is familiar in computer 
science. 

In the present paper, we analyze the limiting distribu- 
tion in the phase estimation systematically, and show 
that the limiting distribution is expressed by Fourier 
transform of a square integrable function on the closed in- 
terval [—1,1], which approximately gives the input state 
in the asymptotic setting. 

In the realistic setting, the optimization of the forth 
criterion is more appropriate than that of the first crite- 
rion, i.e., the optimization of the mean square error or 
the average fidelity. However, the estimator only with 
several special input states were treated in the forth for- 
mulation, and its optimization was not discussed while 
the optimization concerning the mean square error and 
the average fidelity were done in the existing researches 

In the statistics, in order to treat this problem, they 
consider interval estimation, in which, our inference is 
given as an interval. Indeed, there are two formulations 
in the statistics; one is the point estimation, in which, 
our estimate is given as only one point, and the other 
is the interval estimation. In the point estimation, it is 
not easy to guarantee the quality of our estimate because 
the estimated value always has statistical fluctuation. In 
order to resolve this problem, in the interval estimation. 



2 



for given data and confidence coefficient, our estimate is 
given as a interval, which is called a confidence interval. 
In this formulation, smaller width of confidence interval 
is better. As is mentioned in Section |TT1 when number 
n of data is sufficiently large, the confidence interval can 
be provided by using limiting distribution 

Further, we analyze the variance and the tail proba- 
bility of limiting distribution. As our result, concerning 
the limiting distribution, we prove that the limiting dis- 
tribution minimizing the asymptotic variance does not 
minimize the tail probability. It is also shown that the 
limiting distribution minimizing the tail probability de- 
pends on the width of interval. The definition of the 
tail probability depends on the width of interval. Such 
an optimal input state is given by a prolate spheroidal 
wave functionflS'] which often appears in wavelet or time- 
limited Fourier analysis. This function is a solution of the 
linear differential equation [l6l |: 



II. LIMITING DISTRIBUTION 

Let us consider the estimation problem of the unknown 
phase shift 9 with an n-fold unitary evolution V^^" of the 
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in which our estimating pro- 
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tocol is given by a combination of an appropriate input 
state l^o) and a suitable measurement M (See Fig. [J). 



M 
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FIG. 1: Our estimation scheme 

As is mentioned later, this formulation essentially con- 
tains the most general framework with n applications of 
the unknown unitary. In the following discussion, the 
true parameter is described by 9 and our estimate is by 



dx 



ii-x')^ + m)-R'^').f = o 



As is shown in Appendix |X1 this problem is equivalent 
to estimate the parameter 9 of the unitary operation: 



Originally, prolate spheroidal wave functions appears in 
analysis on Helmholtz equation in electromagnetics [l7| or 
determination of laser mode Il8|. Employing this wave 
function, Slepian and PoUak [1^ extended Shannon's 
sampling theorem the case where the time-interval is lim- 
ited as well as the bandwidth while the original Shan- 
non's sampling theorem treats the bandwidth-limited 
case. 

Further, by using these facts, we study optimal interval 
estimation. We provide the estimation protocol minimiz- 
ing the width of confidence interval that assures the given 
confidence coefficient. In this case, the optimal estima- 
tion protocol depends on a given confidence coefficient. 
That means, we must prepare input state properly de- 
pending on the confidence coefficient. 

The paper is organized as follows. In Section |TT1 the 
formulation of the phase estimation is given and the lim- 
iting distribution is introduced with explanation of its 
meaning. In Section [llll we clarify the relation between 
the limiting distribution and Fourier transform. In Sec- 
tion IIV[ we analyze the variance of the limiting distri- 
bution. This problem is reduced to find the minimum 
eigenvalue of a operator in the Dirichlet problem. In sec- 
tion |Vl the tail probability of the limiting distribution 
is discussed. It is shown that the limiting distribution 
minimizing the variance does not provide a small tail 
probability. In section IVIl we treat interval estimation 
problem. This problem can be analyzed by a prolate 
spheroidal wave function and the eigenvalue of the defin- 
ing differential equation. In section IVIIl the phase esti- 
mation with a single copy is discussed in the continuous 
system. The discussions in Sections |IV] and |V] can be 
applied to this formulation under the deterministic en- 
ergy constraint. In section rVIIIi we shortly note on the 
asymptotic Cramer-Rao lower-bound. 



Ue 



k=0 



\k) (fc| 



(1) 



Our scheme for estimating 9 is as follows. First, pre- 
pare an input state \(po) = X]fe=o 1^) where the coef- 
ficients a = {o-k}k=o satisfy the normalizing condition 
J2k l^fel^ — 1- Second, evolve the input state |0o) by the 
unitary evolution Ue. And the last, perform a measure- 
ment described by a POVM M ^ M{9)d9. Then, the 
estimate 9 obeys the probability distribution 



where 



:= Ue |<^o> 



k=0 



\k) 



When our error function is given by R{9,9), we opti- 
mize the mean error De{M,a) J^"" R{9,9)P^^{9)d9. 
We only consider the covariant framework, i.e., the er- 
ror function R{9^ 9) is assumed to be given by a function 
of the difference {9 — 6) Mod 27rZ. For example, when 

we focus on the gate fidehty | p — cos^ the 
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satisfies the covariant 



error 1 — | — |' = sm 

condition. 

Then, our measurement may be restricted into a group 
covariant measurement 



where 



M|,)(d^) ■.^U^\t){t\ul^, 



fc=0 



(2) 



(3) 
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This is because the minimum of the Bayesian average 
value miuM 55: J^^ De{M, d)d9 under the invariant prior 
and the mini-max value miuM maxg Dg{M, a) can be at- 
tained by the same group covariant measurement [Toj . 
Therefore, we restrict our measurement to covariant mea- 
surements in the following discussion. Then, our proto- 
col is described by the pair of the coefficient of the input 
state a and the vector \t) given in ([3]). 

Further, without loss of generality, we can restrict our 
protocol to the pair of (a, |to)) as follows, where 



(4) 



k=0 



For any protocol (a, we define a' ~ {ofc} by 

a!^:^ake-'^K (5) 

Then, as is explained below, the protocol (a', |io)) has 
the same performance as the protocol (a, |io))- 



A:=0 



fc=0 



Therefore, the choice of our protocol is essentially given 
by the choice of input state. Dam et al proved this 
argument in a more general framework as follows. When 
the number of application is n, any protocol can be sim- 
ulated by the above formulation. That is, any adaptive 
application of the unknown unitary can be simulated 
by the n-fold unitary evolution Vf'^ under the above er- 
ror function. 

The main target of the present paper is analyzing 
the asymptotic behavior of output distribution for the 
sequence of input states M. :— {a"}. For this pur- 
pose, we treat the distribution concerning the param- 
eter Zn = "''^2~^^ because the estimate Bn approaches 
the true parameter B with the order ^ when an appro- 
priate measurement and an appropriate input state are 
used. When the random variable z„ converges to a ran- 
dom variable z in probability, the distribution P(A^) of 
z is called the limiting distribution of the sequence of in- 
put states M.. In the case of state estimation including 
the classical case, if we apply a suitable estimator, the 
limiting distribution is the Gaussian distribution under a 
suitable regularity condition. In the classical case, more 
precisely, the asymptotic sufficient statistics for the given 
parameter obeys this Gaussian distribution. That is, any 
estimator is given as a function of the statistics obeying 
the Gaussian distribution, asymptotically. This Gaus- 
sian distribution is characterized only by the variance. 



Even in the quantum case of state estimation, the esti- 
mation problem can be reduced in that of quan tum Gaus- 
sian states family in the asymptotic sense|Toi [III \v\ . In 
particular, if we treat the estimation of one-parameter 
model, we obtain the same conclusion as is in the clas- 
sical case. Hence, it is sufficient to evaluate the vari- 
ance for considering the limiting distribution. That is, 
there is no variety concerning the limiting distribution of 
the state estimation because the limiting distribution is 
uniquely determined as the Gaussian distribution. The 
main problem in the present paper is, on the other hand, 
considering whether there exist a variety concerning the 
limiting distribution of the phase estimation. 

When the cost function 6*) has the form 0) = 
c(B — 6*)^ + o{(B — B)'^), the average error behaves as 
Do{M\ta) 1 0-n) — -^V, where V is the variance of the hm- 
iting distribution. For example, R{B,B) = sin^ the 
constant c is j. Hence, the analysis on the limiting dis- 
tribution yields the asymptotic analysis on average gate 
fidelity. Further, the analysis on the limiting distribution 
provides the asymptotic analysis on the phase estimation 
from another aspect. For example, the tail probability of 
the sequence of input states M can be described as fol- 
lows: 



P7Mf{\en-9\>-}^P{M){\z\>A}. 



A 



(6) 



Using this relation, we can evaluate the required num- 
ber of applications of the unknown unitary gate for the 
given allowable error width B and the given allowable 
error probability e as follows. First, we choose A by 
P{M){\z\ > A} ^ e. Next, we choose n hy ^ = B, 
i.e., n = ^. Thus, the required number of applications 
is equal to ^ if we use the sequence of input states M . 
The above discussions clarify that the analysis on the 
limiting distribution yields various types of asymptotic 
analysis on phase estimation. 

Hence, in the present paper, for a deeper and unified 
asymptotic analysis on phase estimation, we analyze the 
limiting distribution of the sequence of input states A4. 



III. RELATION WITH SQUARE INTEGRABLE 
FUNCTIONS 

In this section, we give a remarkable relation between 
limiting distributions and square integrable functions on 
[-1,1]- 

Theorem 1 For any sequence of input states M. hav- 
ing the limiting distribution, there exists a Junction 
f e 1, 1]) satisfying the normalizing condition 

I~oo l/Wl^dx = 1 and 

where P^(dz) = \F{f){y)f dz and F{f) is 
Fourier transform on L'^{M.) of f, i.e., F{f){y) 
r /(a;)e"Ma;. 



(7) 

the 
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Conversely, for a function f G 1,1]) with the 

normalizing condition there exists a sequence of input 
states M. satisfying the condition 

Due to this relation, we can reduce the analysis on 
limiting distributions to the analysis on wave functions 
on the interval [—1,1]. That is, our problem results in 
Fourier analysis on the interval [—1, 1]. 
Proof: As the first step of the proof, we construct a 
function / G 1,1]). for a given sequence of input 

states A4 :— {a"}. For this purpose, we define a function 
f e L^[-l,l]) hy 



fn{xk) 



Due to (O and pn]) . the sequence of 



input states M.f := {a^"^} satisfies (O. ■ 
For example, when the input state is X]fe=o y^TTTil^)' 
the function / is the constant \l ^. Since its Fourier 



/2ir V 



transformation is given by 
bution is described as 

P^(dy)=^"^^ 



1 

2 • 

sin y 



the limiting distri- 



dy 
2tt' 



fn{x) :-- 



for a: e (xfe - -^,Xk + ;^], where xu 



2k-n 
n+1 



(8) 



In the 



following, the set of the above functions is denoted by 
L^j. The parameter be replaced by the 

parameter ?/„ — because the ratio I. 

Since 



Xk + - 



/„(a;)e"^"dx 



n + 1 2 



we have 



Vn 



1 



7r(n + 1) 

n 

E 



fe=0 



„j(fc-f)(e„-e) 



1 

'2^ 



71+1 



-aze 



dyn 



1 

'2^ 



k=0 

' /„(x)e^-^"dx^%^ 

n+1 



sm 



When /„ goes to a function / e 1, 1]), since 



(9) 



(10) 



n+1 



as n goes to infinity, the distribution of the normalized 
outcome y — y^'^ convergences to the dis- 

tribution P-f{dy). If fn does not converge, by replacing 
the sequence {fn} by a converging subsequence {/ns.}, 
i.e., fn^ /, we can show that the distribution of the 
normalized outcome converges the distribution P-I'{dy). 

Next, we prove the opposite argument. For a given 
function / G 1,1]) satisfying the normalizing con- 

dition, we construct a sequence of input states satisfying 
([7]). There exists a sequence of functions /„ € such 

(n) 

that fn f ■ Then, we define the coefficient a), by 



IV. VARIANCE OF LIMITING DISTRIBUTION 

In the previous section, we have shown that limit- 
ing distributions P-'^ of outcomes are acquired through 
Fourier transforms of wave functions / e 1, 1]) they 

in] 

correspond to coefficients a), of input states. In this sec- 
tion, let us seek the input state minimizing the variance 
by utilizing this fact. As is mentioned in Section [TTl opti- 
mizing the first-order coefhcient of the variance is equiva- 
lent with minimizing the variance V{f) /_oo V^^^ i'^v) 
over all functions / G L^([— 1, 1]). 

Define the multiplication operator Q and the momen- 
tum operator P — — on L^(] 
/ e 1, 1]) satisfies that 



Then, a function 



V{f) = {f\F^Q^F\f) = {f\P^\f). 

Under the natural embedding from L^([— 1, 1]) to _L^(R), 
the minimum value of y(/) is given by 

min (/jP^I/), 
/eL2([-i,i]) 

which is nothing but the Dirichlet problem because the 
restriction of the operator on 1, 1]) is equivalent 

with the square of the operator — on 

{/ e L^{[~i, 1]) n C\[-i, i])l/(-i) = /(I) = 0}. 

That is, the problem is thus reduced to find the mini- 
mum eigenvalue of the operator P^ . Its eigenvalues are 
Tr'^m'^{m — 1,2,...) and corresponding eigenfunctions 

are 0m (x) — 2-\/2sin7rm(^ii)/Cm [13] where Cm is a 
normalizing constant. 

Note that a careful treatment is required for a function 
/ e L^([-l,l]) when the Dirichlet condition /(-I) = 
/(I) = does not hold. In this case, the function / has a 
discontinuity at 1 or —1 as an element of L^(R). Hence, 
the variance (/j-P^j/) is infinity. For example, in the case 

oi f — the variance sin^ y^ diverges. In this 
case, the limiting distribution is obtained with the order 
^ [l| while the mean square error goes to zero only the 
order i, i.e., we have a quadratic speedup concerning 
limiting distribution, but no quadratic speedup concern- 
ing mean square error. This fact is closely related with 
the divergence of the integral sin^ V^- 
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V. TAIL PROBABILITY OF LIMITING 
DISTRIBUTION 

In the asymptotic statistics, the behavior of tail prob- 
abihty of the limiting distribution is one of the most 
important properties [2l[ because it provides the per- 
formances of interval estimation and the powers of the 
one(or two)-side test. Thus, we consider the tail proba- 
bility of limiting distribution P"^™ . 

In the statistics, the tail probability of the limiting 
distribution is often discussed [21j. So, we consider the 
tail probability of limiting distribution P"^'" . In the i.i.d 
case, the minimum tail probability and minimum vari- 
ance among limiting distributions can be realized by the 
same Gaussian distribution. However, in our setting, the 
Gaussian distribution does not minimize the variance. 
Hence, it is not clear whether the minimum tail prob- 
ability can be attained by the same distribution as the 
minimizing the variance. 

Corresponding limiting distributions are acquired 
through the Fourier transforms: 



Since 




P*"(y) = |F(0„O(2/)r 



C™(/<™(a;)e'^fda: 





1 


" ^i(y-rmT)x ' 


' 1 


i{y + rm:) 


x=0 


i{y — rm:) 


x=0 J 



^(1- (-l)'"e*^) ( i i 

^2 y — m^TT 



limiting distributions are 

Thus, the tail probability of P"^™ decreases with the or- 
der 0{y~'^). In order to improve the tail probability, we 
focus on the well-known fact that the Fourier transform 
of a rapidly decreasing function is also a rapidly decreas- 
ing function. In our problem, the support of the original 
wave function / is included in [—1, 1]. Under this condi- 
tion, / is a rapidly decreasing function if and only if / is 
smooth function. Note that a rapidly decreasing function 
docs not decrease 'suddenly'. That is, the smoothness is 
an essential requirement. For example, the function 
is not smooth at —1 and 1. In the following, we construct 
a rapidly decreasing wave function / whose support is in- 
cluded in [—1,1]. In this construction, the smoothing at 
— 1 and 1 is essential. 



9o{x) ■■= 



First, functions go, gi, and g2 are defined by 

' if .T > 

otherwise, 
gi{x) := g{x + 1), 
52(2;) := g{i-x). 

Using these functions, we define a rapidly decreasing g^ 
whose support is included in [—1, 1] by 

93{x) = gi{x)g2{x)/C 

where C is the normalizing constant. As is checked nu- 
merically (See Fig. [H), this function improves the tail 
probability. 

Now, we analyze the decreasing speed on the tail prob- 
ability of P^^{[—R, RY). Their Fourier transformations 
are 



Fi9o)iy) 

F{9i){y) 
F[92){y) 

F{93){y) 



1 cxp(- 



^expsgn(^/)^(y2M + J), 

y\ 4 



V2 

e-'^FigoKy), 
-e-^yFigo){-y), 

iV2^F(gi)*i^(g2)(y) 



where F{gi)*F{g2) is the convolution of F{gi) and F{g2) 



When y is sufficiently large, \F{gi){y)\'^ 



-V2|vl for 



i = 1, 2, i.e. 



lim 



-1 



log|F(gO(y)|2 = 2V2, i^l,2. 



Then, as is shown in Appendix IB| we obtain 

lim log27r|F(<?i) * F(<?2)(2/)P > 2^2. (11) 

V\y\ 

Therefore, there exists a function / such that the tail 
probability of P-^ is exponentially small and the support 
is included in [—1,1]. Note that, the above wave function 
(73 does not minimize the variance V{f). This fact tells 
us that the input state minimizing the variance is not 
optimal concerning the tail probability of the limiting 
distribution. That is, the optimal input state depends 
on the choice of the criterion. 

Next, we consider the maximization of the probability 
Pf{[-R,R]). For this purpose, we denote the natural 
projection ILn from L'^{R) to L'^ {[- R, R\) . By using the 
operator Fr := F^UrF, this probability has the form 

{f\FR\f). 

That is, our aim is the following maximization: 

{g\niFiiIli\g) 



max (f\Fri\f) = max 



max 



llfflP 
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This problem is equivalent with the calculation of the 
maximum eigenvalue of YliFnHi. 

Slepian and PoUak [3l showed that the eigenfunction 
V'i? of ni^Ffllli associated with the maximum eigenvalue 
is given as a solution of the linear differential equation, 
which is called prolate spheroidal wave function: 



d_ 
dx 



(i-^')£ + (e(i?)-i?V)/ = o, 



where ^(R) is chosen depending on the minimum eigen- 
value. 

Slepian jlQ] showed that the maximum eigenvalue X{R) 
of HiFrIIi behaves as 



3 



1 - \{R) - 4V7ri?e— '(1 - ^ + (12) 

when R is sufficiently large. The numerical calculation 
of this minimum probability minyg2:,2([_x j]') [[—R, R^) 
is given in Fig. |31 Thus, the minimum probability 
minj£^2([_i j]) {[—R,RY) can be evaluated as 



lim log min PU\-R, R]'') ^ 2. 



That is, the minimum 
minygi2([_i^i]) P-^([-i?,i?]'=) goes 



tail probability 
to zero with ex- 
ponential rate 2. This optimal value is attained when 
the input state is given by the eigenfunction -ipn of 
PiFnHi associated with the maximum eigenvalue A(i?). 

Now, we numerically compare the functions (f)i, g^, 
'02, and Vio- The density functions of the dis- 
tributions P"^!, PS'', P'/'2, P'^if are plotted in Fig. 
[3l Their tail probabilities are plotted in Fig. ID 
The tail probabilities P'^^{[—y,yY) (thick dashed) and 
P'^'^°{[—y, y]'^) (thick solid) attain the minimum tail prob- 
ability minf ^1^2 11]^ P-f {[—y,yY) only at 2 and 10, re- 
spectively. The distributions P*^"^ and P"^^ concentrate 
in the range [—2,2], however, their tail probabilities are 
not decreasing as rapidly as those of the distributions P^-' 
and P'^'io. This comparison indicates that the optimiza- 
tions of the concentration and the tail probability are 
not compatible. That is, the distributions of the Fourier 
transforms of the functions 53 and "010 have a small tail 
probability (Fig. [3]). These functions are smooth at —1 
and 1. That means, we have checked that the smoothness 
is closely related to the tail probability. 

Next, we generalize this problem slightly, i.e., we maxi- 
mize the probability P-''([i?i, i?2])- In this case, the maxi- 
mum value coincides with that of P-^{[{Ri — i?2)/2, {R2 — 
i?i)/2]), and its maximum is attained by the function 

Since the function R 1-^ P-f{[—R, i?]) is a strictly mono- 
tone increasing function, the inverse function f3 '—^ i?(/3) 
is a strictly monotone increasing function. Thus, 




FIG. 2: The wave function minimizing the variance (pi (thin 
dashed), the rapidly decreasing function whose support is 
included in [—1, 1] (thin solid), prolate spheroidal wave func- 
tion tp2 (thick dashed), and prolate spheroidal wave function 
tpio (thick solid). 



pf{y) 




FIG. 3: Probability density function minimizing variance 
P'^i (thin dashed), probability density function improving tail 
probability P®^ (thin solid), probability density function P'^^ 
(thick dashed), and probability density function P'^'io (thick 
solid) . 



Further, the LHS coincides with 

min mm{R\PU\-R + a,R + a\)> 13} 



mm mm 

/eL2([-i,i]) 



{R\pf{[~R,R])>P}^R{p). 



for any real number a. 



VI. INTERVAL ESTIMATION 

Now, we treat the phase estimation problem with the 
interval estimation. In the interval estimation, given a 
confidence coefficient f3, we estimate the confidence in- 
terval [i, J7], which the unknown parameter 9 is guaran- 
teed to belong to with the probability (3. Here, since our 
parameter space is the torus R/27rZ, a careful treatment 
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logP/([-y,yn 




is required for the confidence interval [i, (7]. That is, for 
L,U G [0, 2tt), the confidence interval [L, U] is defined as 
a subset of M./2ttZ by 



[L,U] := 



[L..U] 



liL <U 



\L.U + 2tt] otherwise, 



and its width is defined 



\[LM 



U -L if L <U 

U + 2tt — L otherwise. 



FIG. 4: Logarithm of tail probabilities log {[-y, y]") (thin 
dashed), \ogP^^{[~y,y]'') (thin solid), log P'''^ ([-y, ) (thick 
dashed), logP^^° {[—y,y]'^) (thick solid), and Logarithm of 
the minimum tail probability log miny^gj^2([_i_i]) P^([— j/, y]*^) 
(thick dotted). 



In the interval estimation, the upper bound U and 
the lower bound L of the interval are chosen from 
the outcome oj obeying the distribution Pg^^. Since 
a smaller width | [_L(ci;), ?7(u;)] | is better, we minimize 
the width \[L{uj),U{lj)]\ with the condition P^'^^{uj\9 e 
[L{uj),U{uj)]} > 13 for any e [0,27r). That is,' we con- 
sider 



mm 

U,L 



lin (max|[L(a;),C/(cc>)]| P^'^{uj\e e [L{uj),U{uj)]} > (3, e [0,2tt)} 

(max{|[L(c^),C/(c^)]||P5{^|0e [L(c.),C/(u;)]}>/3}) (13) 



= mm max 



The value ([T^ has a mini-max form of the cost max^^ ||[L(ijj), {/(w)]] P^g.{w|0 £ [L((jj), [/(cj)]} > which has a 

covariant form. Thus, we can restrict our measurement into covariant measurement Hence, our problem is 

reduced as 

min(max{|[L(a;),C/(a;)]| \Pf^s{^\e £ [L{uj),U{uj)]} > /?} 

t.a \ ' ' 

However, since it is quite difficult to treat this optimization with a finite n, we treat the following asymptotic setting 
as follows: 

lim nmm(miix{\[L{oj),U{uj)]\ iPfA^l^ e [L{uj),U{uj)]} > /3]) 
— min max I |[L(w), J7(a; )1| IP-'^ {<^\e e \L(uj), U(u})]} > B] 



min min{2i?|P^ (\-R, R]) > B} 

/GL2([-1,1]) 

2i?(/3). 



This optimal value is attained when the input state con- 
structed by the wave function V'-R(/3) ^-nd the measure- 
ment is given by the covariant measurement ^ with 
the vector |io). That is, there exists a pair of func- 
tions U and L such that \[L{uj),U{uj)]\ < 2i?(/3) and 
Pe'si^l^ e [L{uj),U{ui)]} > (5. The optimal input state 
depends on the choice of the confidence coefficient /3. 



VII. CONTINUOUS CASE WITH SINGLE 
COPY 

Let us consider the phase estimation in the continuous 
case with single copy, in which by inputing the wave func- 
tion /, we estimate the parameter in a group-covariant 
model pB = |/) (/| e^^'^'^ on the space L'^{R). 

It is known that when the shift-covariance condition is 
assumed for estimators, our estimator is restricted into 
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the measurement of the observable P[23|- Then, the out- 
come 9 obeys the distribution P-'', and the variance of the 
outcome is given by (/|AP^|/), which is abbreviated by 
(AP2). 

If we can input any wave function /, the variance can 
be reduced infinitcsimally. Hence, it is natural to assume 
a constraint for input wave function /. Here, we assume 
that the potential is given as a monotone function of the 
absolute value \Q\. While we often assume a constraint 
for average potential, we consider a deterministic con- 
dition for potential. That is, the wave packet of / is 
assumed to exist only in the region where the potential 
is less than a given constant. In the following, for a sim- 
plicity for our analysis, we assume that the input wave 
function belongs to L^([— 1, 1]). Hence, the discussion in 
Sections IIVI and fVl can be applied to this problem. 

Here, it is meaningful to consider the relation with 
the Cramer-Rao bound. It is known in general that the 
Fisher information Jg for a group-covariant model 



i9Q 



I/) (/h 



IS 



given by Jg — (AQ^) because the 



symmetric logarithmic derivative (SLD) is given by Q 

{qM- 

Since the operator P has a commutation relation with 
Q, we have the Heisenberg hmit (AP^) (AQ^) > 1/4, 
which is equivalent with the Cramer-Rao inequality: 



1 



Especially, if and only if / is a squeezed state satisfy- 
ing (AP^) ~ c and (AQ^) = c^^, the above inequality 
is achievable because its attainability is equivalent with 
that of ( AP2) (AQ2) > 1/4. Thus, if / is not a squeezed 
state, the Cramer- Rao lower bound jJg^ cannot be at- 
tained uniformly in the one-copy case. As is shown in the 
next section, our asymptotic case is essentially equivalent 
to the above group-covariant model under the restriction 
of supp/ C [-1,1]. 



information Jo,„ is given by i Jo,„ = (/o,n| lo,n)- 

= (^0,n| lo,n) 

= (<^0,«| <l>0,n) - \ {<l>0,n\ '/'0,„)| 
n / 71 



k=0 



\k=0 



where |^o„) = -§g \4>d,n)\g_g- Choosing a smooth hmc- 
tion /„ by ([8]), we have 



Jo,n 



4(71+1) 



k=0 

When /„ converges to /, 



\k=0 



Jo,n 



lim , so — I 

n^oo 4(n -I- 1)^ J^i 



x'\f{x)\\lx- 



x\f{x)\ dx 



{f\AQ'\f). 



Since the variance of the limiting distribution is 
(/I AP^ I/), we obtain the limiting distribution version 
of the Cramer-Rao inequality as 



(/|Ap2|/)> 



1 



1 



lim„ 



The equality holds if and only if the wave function / is 
a squeezed state. However, since the support / belongs 
to [—1, 1], the equality of the above cannot be attained. 
This fact indicates that the Cramer-Rao approach does 
not yield the attainable bound in the estimation of uni- 
tary action even in the asymptotic formulation, while this 
approach generally yields the attainable bound in the es- 
timation of quantum state. This point is the essential 
difference between the state estimation and the unitary 
estimation. 



IX. CONCLUSION 



VIII. ASYMPTOTIC CRAMER-RAO LOWER 
BOUND 

Now, we consider the relation of our discussion with 
the Cramer-Rao lower bound because the Cramer-Rao 
approach is often employed in the asymptotic estima- 
tion of the unknown unitary Q. When we apply the 
sequence of protocols A4 :— {a"}, the phase estima- 
tion can be treated as the estimation problem in the 
state family {|(?!)e,„)(0e,„||6' e [0, 27r]}, where |0e,„) = 
ELo«fce''=^ \k). Let us calculate the SLD Fisher infor- 
mation. From the group covariance of the output state, it 
suffices to calculate the SLD Fisher information at 6* = 0. 
Let \le,n) ■■= (1 - \(l}8,n) (0e,„|)^ |0e,„). The SLD Fisher 



As a unified approach to the asymptotic analysis on 
the phase estimation, we have treated the limiting dis- 
tribution on the sequence of estimators because we can 
recover various asymptotic performance of the estimation 
protocols from the limiting distribution. 

As the first step, we have found a one-to-one corre- 
spondence between a limiting distribution and a wave 
function on L^([— 1, 1]). That is, we have shown that any 
limiting distribution is given by the absolute square of 
the Fourier transform of a wave function / e L^([— 1, 1]). 
Due to this correspondence, it is sufficient to optimize the 
distribution given as the square of the Fourier transform 
on L^{[-1,1]). 

As the next step, the minimization of the variance has 
been treated among the above distributions by treating 
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the Dirichlet problem in the similar way as Buzek et al 
We have also considered its tail probability. In or- 
der to guarantee the small error probability out of the 
given interval, the limiting distribution is better to be 
rapidly decreasing. However, it has been clarified that 
the limiting distribution minimizing the variance is not 
rapidly decreasing. In order to construct such a limiting 
distribution, we employ a smoothing method so that we 
construct a rapidly decreasing function whose support is 
included in [—1, 1]. It has been numerically checked that 
this function improves the tail probability remarkably. 

Further, the tail probability for a given interval has 
been minimized among these limiting distribution by 
employing the Slepian and PoUak's analysis on signal 
processingfisjl. The optimal limiting distribution de- 
pends on the width of this interval. Using this opti- 
mization, we have treat the interval estimation in the 
asymptotic setting. 

Next, we have treated the relation with the phase esti- 
mation in the continuous system with the one copy set- 
ting. In this case, the Heisenberg's uncertainly relation 
is equivalent with Cramer-Rao inequality. Using this re- 
lation, we have obtained the condition for attainability 
of Cramer-Rao inequality. Further, we have applied this 
relation to the asymptotic analysis on the variance of 
the phase estimation. Then, we have clarified that the 
Cramer- Rao bound cannot be attained in our framework. 

Throughout these discussions, it has been clarified that 
the optimization of asymptotic phase estimation cannot 
be characterized by a single parameter while this prob- 
lem can be characterized by the single parameter, i.e., 
the variance, in the state estimation of a single parame- 
ter model with a regularity condition due to the asymp- 
totic normality [13, Hll, This property is the biggest 
difference from the state estimation. 

Indeed, a similar property can be expected in a general 
unitary estimation. It is a future problem to investigate 
the limiting distribution in the estimation of unitary op- 
eration in a more general case. 



APPENDIX A: ELIMINATION OF 
MULTIPLICITY 



The unitary V^" can be written as the form 

n mk 
k=Oj=l 

with the multiplicity nik — ()!) . When the unitary 
Ug acts on the input state X]fc=o Sj^i '^fcj l''^; j): the 
final state is given by I]fe=o SjL\ e'^''~'^'^«fe,j 1^' 
ELo e^e^-^^^er^i ak ,\k,j)) = E Lo e^('=--)^a. \k)) 
where at := \JY.7=i I'^fejP ^^d |fc) := 
^ EjL\ ^fej 1^' j)- Then, the estimation problem 
of Uq can be reduced in that of Uo given in ([1]) . 



APPENDIX B: PROOF OF (fTT|) 

Now, we prove ((TT|) . Assume that y > Q. For a given 
integer A^,s 



C 

N 



F{g,){y')F{g2){y-y')dy' 



N 



\F{gi){y')\-\F{g^){y-y')\dy' 
\F{9i){y')\-\F{g^){y~y')\dy'. 



C 



Since \F{g\){y')\ is bounded. 



C 



\F{gi){y')\-\F{92){y-y')\dy' 



0{ \Fig,){y-y')\dy')^Oie-^'^ 
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Similarly, 



C 



\F{g,){y')\-\Fig,){y^y')\dy'^Oi. 



y 



Further, 
V2^ 



niax , \F{g,){y')\-\F{g2){y-y')\ 



N 



Therefore, 



C 



F{gi){y')F{g2){y^y')dy'\ < Oie-^^^^^). 
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Since N is arbitrary, Taking the square, we obtain pT|) 

In th( 

Fi9iW)F{g2)iy-iy)dy'\ < 0{e-^^). hy-y'. 



/K— POO In the case of y < 0, we can show (fTT|) by replacing y' 
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